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Assessing the Need: 
What Information and Activities Should We Preserve?’ 


Michael L. Miller” 


Abstract 

Archivists, records managers, librarians, and technology advocates have developed visions of what 
electronic records could and perhaps even should be preserved for various purposes. This paper looks ar 
the issue from @ very pragmatic point of view of someone who has worked in both a Federal agency that 
produces and disseminates huge quantities of information and an archival program thar has attempted to 
deal with the flood of Federal records for roughly a quarter of a century. The author examines some 
questions underlying the title of this paper with the goal of highlighting areas of potential agreement and 
disagreement on how the issue of preservation should be addressed. The author argues that we need to 
look to limit the amount of documentation we seak to retain over time and match our preservation plans 
to our management Capabilities at this time. . 


Introduction 

The raquest to write this paper brings to mind two stories. The first is about myself. When I was in the 
railitary back in the late 1960s, I bought a (for the time) very elaborate 35mm camera, with multiple 
lenses, filters, external flash and lots of other gadgets. As was typical of cameras of that time, this 
“fancy” camera did not have automatic light metering and focusing. After a few years of taking 
“artistic” photos, T found myself using the camera primarily for svapshots of friends, a task for which it 
was unsuited (in my hands anyway). I eventually bought a much more modest 35mm camera that wes 
entirely avtomatic and fit my needs very well. 


The second story Is much older, one of Aesop’s Fables, the story of A Boy and the Filbests.3 


“A boy put his hand into a jar of Filberts and grasped ag many as his fist could possibly hold. 
But when he tried to pull it out again, he found he couldn't do so, for the neck of the jar was too 
small to allow of the passage of so large a handful. Unwilling to lose his nuts, but unable to 
withdraw his hand, he burst into tears. A bystander, who saw where the trouble lay, said to him, 
“Come my boy, don’t be so greedy: be content with half the amount, and you'll be able to get 


your hand out without difficulty.” 
Do not amempt too much at once.” 


What ties these two together in my mind is the need to think through very clearly exactly what we need 
to preserve and why, when it comes to electronic records. The question in the title of this paper is not 
new - it is at the core of the archival discipline. [n its most basic form, the question of what we should 
preserve is one of detennining value - value to whom, for what purpose(s) and for how long. Some 
would say it is the most critical question facing archivists, and most would agree that at least it is the 
most fun to debate. The volumes of literacure on the subjest bear witness fo that fact.4 This paper is not 
meant to resolve the ongoing debate, but rather to frame the debate and identify some of the critical 
issues from the perspective of someone with both records management and archival backgrounds. 
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The question “What Information and Activities Should We Preserve?” seems simple enough, but the 
underlying issues are far more complex. J see three basic sets of questions: 


Wheat is the “what” that we are considering preserving? 
Whar do we mean by “precatve” in this context? 
Who mukes up the “we that will be doing the preservation? 


The format of this paper will be to look at cach of these themes in detail, raising a series of questions 
about the theme and then offering a position from the paint of view of this archivist/records manager, 
Hopefully this will serve as the Springboard for broader diseussions. 


question has always been how are the documents selected, and by whom, and for what purposes. Do 
electronic records resolve the necd to he selective? 


In my view, selection will be necessary in the future, The only question is how that selection will bs 
accoruplished, Selection of documentation for preservation can be based on some definition of value - 
for organizational needs, specified purposes, potentis! or actual use, or other considerations. 
Preservation can also be based around management and cost issues so that older information is preserved 
only as long as it does not require extensive, or in some cases, any additional resources. In many cases, 
Preservation is in fact random, especially the preservation of paper records, which can be “preserved™ 
because people neglect them by throwing them into an attic or basement where they are discovered 
decades, even cenmuries later. Fortunately or unfortunately, the last scenario “preservation through 


neglect” is not an option for electronic reconds, making some agreement on what to preserve even more 
important than it was in the past. 


Examining the “What” 

Margaret Hedstrom subtitled one of her many articles on electronic records issues “Deciding What is 
Essential and Imagining What is Possible.”© The challenge of imagining what is possible and deciding 
what is essential is crucial to all areas of electronic information, but especially when it comes to laying 
out the potential types of activities for documentation. It it here that we need to be sure to got beyond 
Terry Cook’s warming about the problem of using paper minde to oveluate electronic records.7 


The “what” question can be subdivided into two related questions, The first question is the “imagining 
what is possible part” and the second is the “deciding what is essential” parr. 

What are the basic types of documentation that might be preserved and why? 

What qualitice and functionality of that documentation warrant Preservation and for what 
Purposes? 


The question that is not included is what specific activities deserve 10 be documented. That is a 
deliberate omission. Whether, in the grand scheme of things, AIDS research is more worthy of 
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documentation than drag racing is not a question to be addressed in a general paper. AJI activities have 
their champions who can explain better than I why the record of their activity deserves to be preserved. 
My goal here is to lay out a framework for deciding what typcs of documentation to keep to meet your 
neods assuming that you have decided to document the activity. 


The following Is a brief list of the types of documentation that should be considered for preservation, at 
Teast in the “imagining” stage. In some cases the categorics overlap, but the goal here js ensuring breadth 
of coverage not mxonomic exactitude. The terminology may smack of archives-centrieness, but 
hopefully the descriptions wil! allow al) information professionals® to recognize the types of 
Gocumentation under discussion. These categories are not presented because I think that they all need 


be peesecved, but because each has been suggesred for preservation at one time or another for certain 


pal © = ated 


: by External Audiences, This category would include any type of 
elecrronic product meant for use as an “information resource” by an audience external to the creating 
organization. Such final products may be only in electronic form or may be available in @ variety of 
media. Examples include newspapers, books, reports, catalogs, data products such as Census files, final 
processed observational data, digital photographs, dara products released ac part of scientific 
collaboration, and Web pages. The basic characteristics of all of these items is that they are final, 
finished products meant primarily for use by others and not as part of documentation of the creating 
organization's regular day-to-day activities.10 The primary goal of these produets is 10 disseminate 


Final Documentation Needed as Evidence ofa Transaction, Activity, or Program. This category would 
include final, finished documentation of the organization's activities and programs. They are 
distinguished from the first category because their primary purpose is to conduct a “business transaction” 
hot to simply provide information to an extemal audience (although they may do that as well). The 
documentation may consist of a single document, a series of documents (file), or an entire recordkeeping 
system. Examples would include letters to constituents, a will or deed, an international treaty, bills for 
services, an electronic commerce interchange, a court decision, the records ofa court case, or the 
administrative record documenting the development of a government regulation. The key tharacteristics 
of such documents are that they are used primarily to accomplish a business purpose, aré relied upon by 
the organization as the documentation of their activity. Generally speaking, the organization or parties 
imvolved determine what constitutes the “official record” of the transaction based on law, regulation, or 
best practices. Once created, the record needs to be retained primarily as evidence that the transaction 
took place and was properly carried out. 


Although these two products differ by purpose and intended audience they share e number of 
characteristics. First, they are the “official positions” or statements of the creating organization. This 
makes them useful for some types of inquiries (¢.g., what wes the Supreme Court decision in a specific 
case), but less useful for others (c.g., how did the court really arrive at the decision). Second, as the 
official positions of the organizations, they may well deserve to be preserved, even after they have been 
superseded. Third, both types of products sit atop a pyramid of other documentation thet supports them. 


Administrative Data, Consists of data created and collected by an organization to carry out its functions 
and activities. Such data will serve as a major information resources for the organization and will also be 
used for analytica) purposes and to develop products discussed above. However, despite the variety of 


3 


oa 
004/012 


06/07 13:23 FAX 3018702765 HISTORY ASSOC 
06/97 12:49 e NARA 
Draft of 2/6/97 


tises to which they are put, all of these databases have an ongoing organizational functionality that 
requires thar they be maintained over time to meet the business needs of the orgenization. Examples 
fnglude budget, personnel, payroll, and other administrative sycteras; systems to track accounts, systems 
to track and manage programmatic functions such as the issuance of permits, systems to track the 
lasuance of passports, and other types of systems that document how a specific organizational activity is 
managed. Tn their active life these systems support the work of the cfeating organizations, but they may 
be of Interest to those researching specific areas documented by the system. 


Software This caregory includes off-the-shelf commercial software, commercial software as modified 
by the organization to meet organizational needs, custom designed software, expert systems, computer 
models, and other software used by the organization. A record of software has two purposes, First, ic 
allows information that is in a software dependent format to be need, Second, a knowledge of software 
used by an organization and the capabilities and limitations of the software is critical to understanding 
the abilities and inabilities of users to accoruplish certain tasks, Some examples of how information 
about software and how it works may help. First, it has been the policy of the Environmental Protect 
Agency (EPA) and other Federal agencies to print to paper any e-mail mexsage qualifying as a Federal 
record. To properly evaluate the resulting paper copies, it is necessary to know that the system printed 
the date the message was originally created, not the date it was sent, even when the two datcs were 
different. Ata higher level, a copy of the expert system used by EPA permit writers to develop permit 
documentation will reveal a lot about the process used to issue permits and why some applications would 
be denied. Finally a record of the changes fo the massive modeling system used to produce the national 
economic models or to the system used to interpolate data in Census studies will be crucial to 
understanding the validity of the data. 


Explanptory Material Negded to Understand the Final Products. Electronic documents, information, or 
objects frequently are not self-explanatory. To use or understand them generally requires additional 
information that varies by the type of electronic record under discussion. For darabases, one needs 
information about data elements and codes for example, while metadata sbout how and why a record was 
created is fundamental to understanding its role in the activity it documents. To be truly useful, scientific 
data needs contextual information about its accuracy, the instrumentation used to collect it, and data 
limitations. Bottom lins is that to fully understand any document (electronic or otherwise) there must be 
sufficient explanatory material retained with it. 


Source Materials. This category includes those materials that were mined to produce a final product Jt 
may include raw data that is later corrected, edited, and enhanced or finalized; databases from which 
subsets are extracted; geographic information system (GIS) data layers used to create specific products; 
and a data warehouse used to conduct and analysis, historica) data used for projections, and data usad in 
computer modeling. !! 


Product Development Materials. This category includes the documentation created during the 
development of a final product, but not required or judged to be part of the final product or official 
documentation. Examples Include notes, working files, drafts, recards of receipt of electronic mail 
messages, and other documents created during the business process. In the case of scientific data, it 
might Include the results of experiments, measurements, Of testing, nat required to support publication of 
the results. In socio-economic analysis it might include data analyeas, the results of computer modeling, 
and the data used to test hypotheses. In land use studies, it might be the results of GIS runs used to 
identify potential sites for a waste fecility. Although these materials were judged not to be necessary for 
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the final documentation of the activity in question, they were consciously created as part of the 
Gevelopment process, and may provide valuable information about how a specific activity was carried 
out. 


Data Management Information. This category covers the documentation created by the computer system 
or application independent of the will of the individual user. In document création it would include the 
basic information created on document size, initial creation date, the number of editing sessions, a record 
of changes made to the document during the session; and other Information captured automatically. In 
developing a GIS product it is the record of how the actual processing took place. In the case of EDI 
transmissions it is the logs of the various translations, transfers, receipts, and other stages in the 
transmission and processing of the EDI transaction. Relating to the Web, it inchides such diverse records 
as the record of the session and the sites visited, the recards of whar information was available on the 
Web at a particular time, and information captured about thase visiting the Web site. In database 
applications ic includes audit trails, records of who accessed the system, and changes to the data. 
Systems management information would include information abour aystems ysers, passwords, log-ons, 
and other information used to manage the systam itself. . 


Tying all of these different types of information together is the fact that they were created to aid in the 
management of computer systems and the reconstruction of data in case of system failure. However, all 
Of them can be used to document various aspects of an individual user's activities, and may, in some 
cases, actually shed light on how @ transaction occurred or an activity was completed As an example, 
proving the integrity of s recordkesping system might involve keeping a record of access Io the Fystem, 
the passwords and PINs of persons having access, and any changes made to the date 


Informal Electronic Communications. Much e-mail traffic consists of sharing of information and views, 
exploring areas of common interest, and persona] communications of all sorts, For many. email has 
yeplaced both the telephone and the letter as the preferred way to communicate with peers, family, and 
friends. One nead only think of the variety of listservs, char rooms, personal group lists, and informal 
collegue-to-collegue communications to understand that there ia a wealth of communication going on 
that may be business related that is not linked to a specific “business transaction” or “product.” Even so, 
these communications may provide tremendous insight into the workings and cultures of many segments 
of our society. The informal and “private” narure of the medium means that many people are far freer in 
expressing themselves ovcr e-mail than they are in other media. Although there are a series of issues 
relating to the privacy of persons involved in such exchanges, those who want to capture the flavor of the 
late 20th century will find them a wealth of information about haw we live and work. }2 


What Dors It Mean to Preserve? 

Preservation of electronic information has had a checkered history to this point. Those trying to preserve 
information clectronically have had to deal with evolving hardware and software, fragile media, and the 

need to maintain not only the information iteelf, but the documentation and software necessary to support 

it. Whether the preservation is done by data libraries, data centors, or archives, it has presented 

problems. Jt js this history of limited success in maintaining electronic records that serves as the context 

for my remarks. While the future is brighter in this area, the present offers significant challenges that 

make me wary of attempting too much. , 
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Tt is my firm belief that the best documentation is the result ofa clearly bounded understanding af the 
entity (¢.g., organization, event, city, phenomenon, parson, etc.) to be documented and a firm vision of 
what it js that is to be documented, and the level of detail in documentation that is sufficient, In 
govemment archtves, the current formulation of this theory is known as functional appraisal, where the 
archivist looks at the major functional activities of an agency (6.g., issuing permits for pesticide uses) 
and determines what documentation should be preserved to document that activity. The short hand 
version of the theory is that the erchivist has a better chance of preserving # useful record for the funire if 
s/he simply selects the documentation of what the organization does that is important In its view, rather 
than trying to determine what records might be useful in the furure based on the types of information 
they contain. There are oxceptions, but this is the general rule,!3 


For example, the Environmental Protection Agency (EPA) creates extensive files concerning cach 
pesticides, its formulation, product testing, potential risks, and other aspects that are taken into 
consideration in reviewing the application. They were appraised bared on the documentation-they 
provide of how the Agency carried out an extremely sensitive funetion, as the strawberry growers in 
Califoria know full well. However, historians of agriculture, science, Jabor, and issues that we haven't 
even thought of yet, may find extensive information about their subjects in these files. 


This is not always an easy task. The National Archives and Records Administration (NARA) has 
developed its bounded aren of concer - records of Federal agencicg that constitute essential evidence, 
What that means in concrete terms is being investigated now.!4 This does however raise another bias of 
mine, which is [ don't believe in saving everything. It can be argued that every type of documentation 
might be useful in resolving some issue. In my mind, that does not justify keeping everything and letting 
technology allow us to store it and search it. Storage is not the major issue - but information 
management, ongoing access in the midst of changing technologies, and ability to comprehend what has 
been stored are very real issues. Therefore I would argue that we need a process to select what we will 
retain, 


The first step in understanding the preservation issue is to look at why we might seek to preserve 
information. I offer a few of the many potential uses of the information we preserve. arranged into what 
I percelve as levels of system functionality to sorve those needs, Level I is the simplest and Level Lif the 
most difficult. 


Level I: Provide information 

Provide answers to particular questions. 

Provide raw material for research 

Provide a historical “record” of what took place (corporate memory). 
Level TT: Serve as the official, Jegal record of activities 

Support the ongoing operation of the business 

Serve as the legal record of the transaction or a portion of it. 

Permit oversight and support accountability : 
Level Tif: Allow for the recreation of the transaction to check validity of results 


T see each of these three levels as being progressively mors difficult and expensive to maintain. { would 
also argue that, as time goes on, the primery uses of the information gradually move to Level I uses, and 
the number of users that have an interest in Level I and TI] uses will decline. Therefore we should be 
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thinking in terms Of a strategy that gradually reduces the functionality of the systems as well as the 
amount of information we preserve to keep them in line with research needs. 
This viewpoint is not universally accepted. A major reason is that it limits accountability and overs 
- the extent that we think in terms of official accountability for activities, a "staruts cf taittatinea ol 
applied that limits the length of time that records need to be maintained for accountability purposes. 
This “predictability” about the uses of the records is essential to their management. For example, the 
EPA limits the length of time it can require the regulated community to produce records required under 
regulations. This allows the regulated companies the ability to manage their records. Yetforthose 
ivestigating wrongdaing - corporate knowledge of health risks from smoking for example - always want 
as much material maintained as possible forever, or until all of the issues have been resolved. This 
perspective finds all information potentially useful and would require the retention of al] information for 
extended periods of time to ensure thar it will be available “just in case.”!5 This desire runs counter to 
the need to rationally manage one’s records and information resources. = 


It the final product sits upon a pyramid of supporting documentation, and the final product is an obvious 
candidate for preservation, the question becomes what supporting documentation (if any) sbould be 
maintained, for what period of time, in whet format, and for what purpose. Each of these four aspects 
deserve detailed examination. Let me say at the outset that I would argue that there are instances when 
each of these documentation sources is worth preserving at least from a theoretical point of view, I 
would also argue that, in the vast majority of cases, the final products of the activities will serve the 
majotity of research uses very well, especially as time goes on. The basic issue is to develop guidelines 
as to what the minimum level of necessary documentation might be, Those whose job it is to preserve 
information and/or records should not be deluded into thinking that what constitutes appropriate and 


indeed necessary documentation in some special cases should become the standard for documenting al) 
activities. 


The first question then js the level of richness in documentation that we wish to maintain. The most 
basic question concerning electronic records and the level of richness to be preserved, is whether we sse 
electronic records us a way of preserving more information about more activities or whether we sec it as 
a Way to preserve the same level of documentation in a format that is easier to access and use. The more 
we wish to preserve electronically the greater the menagement problems. 


Technically, the maintenance of final products is relatively easy, compared with the maintenance of 
supporting materials. The document creator specifically wanted to see the final products preserved, at 
least for a time, so these items are better documented and managed than the materials used to create 
them. Supporting documents may provide crucial information about the evolution of a document for 
example, or they may be a jumble of information thet is difficult if not impossible to understand as 
comext The maintenance of supporting documentation works best when the period of preservation is 
relatively short and the scope of the attempt is relatively narrow. At thié point in the evolution of 
electronic records our ability to identify and link all the supporting materials for a specific activity is 
limited m practice by two practical problems. Programs wishing to preserve supporting documents on a 
regular basis will have to deal with software thet is geared toward allowing maximum capability to 
change and update documents and informetion, and records crestors who are not used to thinking of the 
preliminary documents es records that necd to be managed. ’ 
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aad 9 eager al Soncems the length of time the documentation should be preserved. Archivists 
prestevatinn nie ae Gage " records, but now speak of records of continuing value. In effect 
indoeitiorierdoc & moment of creation and it is simply a question of how long specific 
beg. cuments need to be retained - how long do they have continuing value?!6 Preservation 

cigar Vicwict protess where the level of documenuttion will gradually diminish over time based 
infers saarataee 1s bed It Is very necessary to clearly lay out for the specific activities and classes of 

oe. Mumanea y 13 to be preserved and for bow long. It is quite possible to agree that for some 
aorivt - ly full documentation should be preserved for a specific period of ume - several years 
perhaps, with the idea that unless there is some specific issue, a major investigation or problem with an 
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mel? & newspaper story, the documentation will be reduced in complexity after a spesific period of 


The third question concems the Jevel of functionality that should be retained over rime. In the case of 
ward processing documents, must we preserve the capability to modify those documents, a capability 
(functionality) that the creator would have bad? If we need to retain records of system audits until the 
transactions of a system have been audited, do we need to maintain them after that, or is a simple record 
of the findings of the audit sufficient? For example, it is possible to argue thet for a period of time, 
tescarchers should be able to easily replicate the analysis done on a GIS in support of the siting af a 
waste-water treatment plant for example. However, preserving the complete functionality of the system 
rather than simply the resuhs of the analysis can be difficult, especially if it must be maintained across 
system migrations, This is why the idea of gradually diminishing levels of documentation has always 
made sense. Yes, there would be full functionality to recreate the analysis for a while, because the actual 
decision may actually be in doubt for a while or subject to litigation. But given the costs of maintaining 
that functionality, it would be allowed to diminish over rime as the need to document what actually took 
place and why, superseded the need to be abic to second gucss decision makers. 


Another commonly debated question of functionality is how we go about preserving documents. At one 
end of the scale we have ASCTI text, which is easy to preserve, but does not capture the look and 
structure of the documents. A second svategy might be to use SGML as a standard and convert 
document to that (or HTML) for preservation. A third option would be to preserve the documents in a 
read only format using a viewer, or imaging the documents to preserve their format. A fourth is to ask 
whether electronic media are the best way to preserve records or outdated information over time, For 
documents that are not active and are difficult to maintain (older expert systems for example) does it ever 
make sense to preserve what is needed in paper form? Perhaps only the manuals explaining how the 
systern was used would be sufficient. Even iF there was a need to maintain électronic access, the manuals 
would be easier to maintain electronically that the systems themselves. Are all of these approaches viable 
options, depending on the needs of the users? Is it better to Jeave everyone to develop their own strategy 
or to try to set guidelines on when each is most applicable? 


Finally, we rerum to the question of purpose - why are we retaining these records. To the extent that we 

are attempting to preserve information that people can use to answer questions, there are a number of 
alternatives that would simplify preservation. A creating organization may have to retain extensive 
“overhead” to prove thar the electronic records on their system are indeed the authentic records of the 
organization and are trustworthy in their content. But once the records have sérved their businass 

function, is it still necessary to preserve the full “recordkeeping functionality” of the system? As an ‘ 
example, members of the regulated community producing reformulated gasoline can submit their 

mandatory reports via electronic data interchange (ED). These submissions can be used for civil and 
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criminal prosecution so the ability to manage them as officia] records and ensure thelr complete accuracy 
is very important. However, the EPA can only initiate criminal and civil penalties for o specific period 
of time ~ generally seven years. After that, the submissions stil] have value to those swdying the 
reformulated gas industry and developing new regulations, but they cannot be used for prosecution. 
After the S¢ven years, it makes sense to remove them from the recordkeeping system, and maintain them 
in a deta library with proper security to ensure that the data were not changed. 


To sum up, preservation should be seen as an ongoing process, not only in terms of the need to perform it 
on an ongoing basis, but also because it requires constant reevaluation of what level of richness of 
information and functionality we need to preserve. It becomes a question of resources. The simpler the 
materials we choose to preserve and the more standard the formats, the easier our jobs will be. Given 
limited resources, we nead to develop a clear set of priorities as to what actually should be preserved. 
These priorities should be based on predictable use of the records and a clear understanding of their 
value to particular constituencies for particular purposes, . 


Who Is the “We” That Will Do the Prescrvation? . 

This question is the subjuct of a later paper, but to explain my approach tn the preservation question, I 
need to at least pose my position on this mamter. The “we” should be considered extremely broadly at 
least in the firet instance. It should include records creators, data and traditional libraries, and archives of 
all descriptions. This broad-brush approach has two major advantages and one major disadvantage. The 
first advantage is a philosophical one. A diverse group of preservationists with differing interests and 
values for documentation will result in diverse things being saved, at least initially. A preservation 
program with a well-defined focus and clear boundaries will have « berter chance of preserving the 
richness of events than an institutian with an extremely broad preservation mandate. 


The second is a practical one. [f the program preserving documentation in clectronic farm can focus ona 
specific defined area, it has a better chance to resolve the technical isrucs inherent in any electronic 
records program. An organization that has a single e-mail system can devise a system-specific means of 
preserving the messages from that system more easily that NARA can devise a system for preserving the 
products of multiple systems. 


The rather obvious disadvantage is that there is no guarantec that many of the “preservationist” programe 
will last themselves. As a result documentation may well be lost. This is carainly true, but the final 
result may not be any different than if preservation were left to 2 small select group of programs. {f 
anything, the approach buys time, which in preservation is a major advantage. Theoretically, additional 
time may help sort out what documentation is actually necessary. My own view is that preservation, 
even archival preservation, is for the length of time the documentation continues to have valuc. 

Therefore I assume thar the value of certain types of documentation currently preserved will diminish 
over time, resulting in certain records being discarded in any case.!8 From a technical perspective, the 
additional time allows for new approaches for preservmg electronic records to evalve. 


However, we must be aware that records crcafors are usually not in the preservation business. They have 

their own work to accomplish and the records and information they generate are a by-product of that 

activity. They will preserve the recards and Information as long as it serves their purposes and the 

preservation does not entail significant costs. However, creating groups may balk at the expense of 

migrating older records no longer used to a ncw hardware or software platform. Records creators have a 
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difficult time understandi sa 

information system whi se. whry it ig necessary to preserve a record of the limited functionality of an old 
stages of the otieeeuien we ie to anew onc. Tn sum, records creators may help for the early 
specialists in preserevation, Jong-term preservation has bean and will remain the work of 


Cenclusion 


This paper has attempted to | out some of is F 38 , " 
aud polividies should ay the major choices we face in determining what information 


be preserved to document the digital age. I um drawn back to the boy in Aesop’s 
fable who had to give up some of the Fllbertethiiwere wine grasp in sarbirjptshiti fy 1 find that 
msny of us are cnamored of what the technology can do - if applied as the vendors describe, However, 
many parts of our culture are far behind the cutting edge of mehnology. Partly it is because records 
creators do nat choose to implement many of the options available to them, because they don't see the 
need. Even when they do, the effectiveness of the technology solutions still stands or falls on the 
capability and dedication of the people implementing it. As only one example, it is cleer that records 
creators will Iry to “outwit” records management software by entering insufficient indexing and metadata 
if they do not see the need for it or do not claarly understand how the system works. ‘i 


The parameters of the debate are bounded on one side by the ever-expanding potential of the technology 
and on the other by the much more slowly expanding capabilities of the people employing the 
technologies. The question comes down ro not what should we preserve, but what can we feasibly 
manage ef this point, Manage means two things, First is to physically preserve the information and 
records in a technical sense, ensuring that it {5 accessible despite the passage of time and changing 
technology. This means an ongoing program of preservation. But management also means that the 
information and documentation has the context to make it understandable as well ag accessible. Today 
much of the context for electronic records ig provided by the individuals who created them and worked 
with them. We will need w get beyond that limitation if we are going to be successful in preserving, 
electronic records over the long haul. 


ee Ee 


Endnotes 


1 This paper was prepared for the conference “Documenting the Digital Age,” February 12-15, 1997 and was 
submiqed for pre-conference review on January 21, 1997, Due to the short lead timc Lo prepare the paper for 
review, the author will make a second version available at the conference. 


2 Michas! L, Miller beceme the Program Director for records Management at the National Archives and records 
Administration on Janvary 6, 1997, From 1990 through 1996, he served as the National Program Manager for 
recards and Agency Records Officer for the Environmental Protection Agency. From 1976-1990 was an archivist 
‘with several units in the Netional Archives inoluding the [then] Machine-readable Records Branch and the Records 
Appraisal end Disposition Division. He Is a frequent speaker on records management, appraisal, and electronic 
records issues, and has published a number of articles in the field, Mike received a Ph.D. in history form the Ohio 
State University in 1980, 
3V.5, Vernon Jones, trans. Aésop’s Fables. (New York: Avena! Books, 1912) p. 61, 
4 The bibliography on the subject of appraisal of electronic recards Is immense. It ia probably the most writtan 
about aspect of clectronic records. For some recent bibliography see Richard Cox, “readings in Archives and 
Electronic Records; Annotated Bibliography and Analysis of the Literature,” in Archives and Museum Informatics 
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Technical Report No. 18 (Pimsburgh: Archives and Museurn Informatics, 1993), pp. 99-156; and Lucians Duranti, 
“The Concape of Appraisal and Archival Theory,” American Archivist 57/2 (Spring: 1994), pp. 328-44; and Terry 
Eastwood, “How Goes Jt with Appraisal?” Archivaria 36 (Autumn 1993), pp. 111-121. 

5 The title of this paper uses the term “information” and I will use that term and the term documentation to include 
publications, databasas, documents, information systems, and records. If] am discussing specific typc of 


© Margaret Hedstrom, “Descriptive Practices for Blectronic Records: Deciding What is Essential and Imagining 


Museum Informatics, 1994), I do want to make clear that I coasidcr electronic records to be a small subset of all 
electronic information and subject to more rigid control and management. See Richard Cox, ed. University of 
P Recordkeeping Functional Requirements Project Renorts and Working Papers — Proeress Repo Twa 
(Picsburgh: Univerdity of Pittsburgh School of Library and Infarmarfon Science, 1995). 

® [use the term “creating organization” to mean the entity responsible for creating the information product. That 
entity many be a person, organization, business, etc. 
11 In some cases, the same data may appear in one place as a final product and in another as source materials, as in 
tbe casa of demographic deta provided by the Census Burean 2s a final product wh{ch is used by a real estare firm as 
sourre data in its GIS to study housing patterns, 
12 Seminal work on the use of e-mail in research was published by Avra Michelson and Jeff Rothesnberg, 
“Scholarly Communication atid Information Technology: Exploring the impact of Changes in Research Process on 
Archives," American Archivist 55 (Spring 1992): 236-315, On the potential research uses of electronic 
commounications in a university environments see Anne J Giliilend-Swetland and Carol Huehes, “Enhancing 
Archival Description for Public Computer Conference of Historical Value: An Expleratory Study, American 
Archivist 55 (Spring 1992), pp. 3160330. 
"4 good overview of the current discussion is found in Angelika Menne-Haritz, “Appraisal or Documentation: 
Can We Appraise Archives by Selecting Content?” in American Archivist 57/3 (Gammer 1594), pp. 528-542. 
44 This same question lies behind the term “essential evidence,” a formulation far value used in the National 
Archives and Records Adminiswation’s strategic plan. The question is “essential to whom.” 
15 4 fallacy in this argument is thet many of the smoking guns that are found are the result of information not 
being properly managed. In many cases, the records survived due to stupidity on the part of the creator. or because 
the records were ignored for may years. Electronic records will not survive due to inattention over time. 
16 This follows the line of thinking generally known as the “records contdiduym.” For a recent overview of the 
Idca see Jay Atherton, Corporate Memory and tha Records Continuum,” unpublished paper presented at the 1996 
Annual Meeting of the Society of American Archivist. 
37 This type of reasoning is behind the traditional records management practice of dividing the documentation of 
an activity imo the “official file” which would have longer retention, and the “working file” thar would have a 
shorter retention. 
“Isce generally, Terry Eastwood, “How Goes It with Appraisal?” Aychivarja 36 (Autumn 1993), pp. 113-12), 
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